Estimating evolution of freshness in Internet cache directories under the capture-recapture methodology

نویسندگان

  • Ioannis Anagnostopoulos
  • Christos Anagnostopoulos
  • Dimitrios D. Vergados
چکیده

1 Abstract— In this paper, we describe a new web sampling schema for measuring the evolution of freshness in search engines. The methodology used is the capture-recapture, which is mainly applied for estimating evolution rates in wildlife biological studies. After modifications and amendments necessary for web paradigm application, we conducted three capture-recapture experiments of different duration over the caches of Google and MSN. In parallel, we used a typical sampling scheme, similar to many other web sampling approaches used in the literature, in order to evaluate the robustness of our proposal. The paper provides the implementation details of a web-based capture-recapture model along with its assessment. The results show that through the capture-recapture methodology we are able not only to measure the freshness of the tested search services but also to monitor its evolution over time, with a substantially lower amount of required sampling instances. It was not our intention to compare the performance of Google and MSN. However, through our experiments, we observed that although one sometimes presents better refresh rates than the other, in general both search services have virtually equal capabilities in refreshing their directories and providing new and up-to-date results to their users.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Semi-automatic e-chartering through multi-agent systems and satellite IP networks

Scholarly Contributions [Data Provided by ] Editorial on "qoS and service provisioning for integrated wireless networks" Towards a collaborative ranking mechanism for efficient and personalized internet search service provisioning On the feasibility of applying capturerecapture experiments for web evolution estimations Design and implementation of a VoiceXML-driven wiki application for assistiv...

متن کامل

Estimating the size and evolution of categorised topics in web directories

In this paper a statistical approach for estimating the evolution of categorized web page populations in web directories is proposed. The proposal is based on the capture-recapture method used in wildlife biological studies and it is modified according to the necessary assumptions and amendments for conducting the experiments on the web. During these experiments, web pages are likened to animal...

متن کامل

Estimation of Maternal Mortality Rate in Iran from 2010 to 2014 Using Capture-Recapture Method

Estimation of Maternal Mortality Rate in Iran from 2010 to 2014 Using Capture-Recapture Method Ayat Ahmadi 1, Bahareh Yazdizadeh 2, Alireza Zemestani 3* 1Assistant professor of Epidemiology, Knowledge Utilization Research Center, Tehran University of Medical Sciences, Tehran, Iran 2Associate professor of Epidemiology, Knowledge Utilization Research Center, Tehran University of Medical Science...

متن کامل

Estimation of Road Traffic Mortality in Kurdistan Province, Iran, During 2004-2009, Using Capture-Recapture Method

Background: To reduce traffic injuries in the country, health professionals should have accurate estimates of road traffic deaths. Multiple and sometimes inconsistent statistics presented by organizations in charge create high degree of uncertainty for planners and decision makers. To achieve an accurate estimate, several methods are available. Of them, capture-recapture method ...

متن کامل

A comparison of linear transect and capture recapture methods results in Iranian Jerboa population density and abundance estimation in Mirabad plains, Shahreza

During a period from spring 2008 till fall 2010, Iranian Jerboa population abundance was estimated using distance (linear transect) and capture-recapture methods in the Mirabad plains near Shahreza city in Isfahan Province. In the study period, during the active time of the species except reproduction time, we tried to live-trap, mark, release and recapture individuals based on Schnabel method ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Computer Networks

دوره 54  شماره 

صفحات  -

تاریخ انتشار 2010